CDS
Accession Number | TCMCG075C21025 |
gbkey | CDS |
Protein Id | XP_007020340.2 |
Location | join(182822..182829,182924..183031,183503..183625,184021..184122,184211..184376,184759..184875,184982..185049,185171..185272,185352..185568,185716..185952,186066..186193,186266..186383,186472..186690,186788..186943,187162..187238,187330..187538,187856..188072,188166..188245,188357..188474,188568..188738,189095..189263,189334..189377,189476..189518,189598..189743,189964..190058,190175..191076,191249..191530,191636..191731,192228..192406,192543..192624,192914..193038,193630..193722,194598..194751,194838..194879) |
Gene | LOC18593184 |
GeneID | 18593184 |
Organism | Theobroma cacao |
Protein
Length | 1730aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007020278.2 |
Definition | PREDICTED: proteasome activator subunit 4 isoform X2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCTCAGATGGGGAAATATCTTAGTCAGATTGCTTAATAAGTATCGAAAGAAATTGTCCTTGAAAGTTCAGTGGCGGCCTTTGTATGATACTCTTATTCATACGCATTTCACAAGGAATACAGGTCCAGAGGGATGGAGATTGAGACAGCGGCATTTTGAGACTGTTACTTCCCTTGTTAGATCATGCCGACGGTTCTTTCCAGCTGGTTCTGCCTCGGAGATTTGGTTCGAGTTTAGATCTCTTTTGGAAAATCCTTGGCACAATGCAACCTTTGAAGGAGCTGGATTTGTGAGACTCTTCCTTCCTACAAATTCAGACAATCAAGACTTCTTCTCAGATAATTGGATTAGAGAGTGTATGGAACTCTGGGACTCAATTCCAAATTGTCAATTCTGGAACGGTCAATGGACTGCTGTTATGGCTCGTGTAGTGAAGAATTACAAGTTTATCAACTGGGAGTGTTTTTTACCTACATTGTTTACTAGATTTTTAAACATGTTTGAGGTTCCTGTGGCAAGTGGAAGTGGATCTTATCCTTTTTCTGTGGACGTACCCAGAAATACAAGGTTCTTGTTCTCCAATAAGACAGTCACTCCAGCAAAGGCCATTGCAAAATCAGTTGTGTATTTATTAAAGCCTGGTAGTATGGCACAAGAACATTTCGAGAAATTGGTCAACCTCTTAGAACAATATTACCATCCCTCAAATGGTGGTCGGTGGACTTATTCGTTGGAGCGATTTCTGCTGTATTTGGTGATTACATTCCAAAAACGCTTACAGCATGAGCAGCAGAACACAGATAATGATAGTCAGGCTGAGCTTTACTTAGGAAAATTAGAAAGGAGTGCATTTGTCAATGTGCTGCTGAGGCTTATTGATCGTGGTCAATATAGCAAAAATGAACATCTTTCTGAGACCGTTGCTGCAGCAACATCGATCTTATCCTATGTGGAGCCCTCTCTGGTACTTCCATTTTTGGCTTCTCGATTCCATATGGCCTTGGAGACGATGACTGCCACCCACCAGTTGAAAACTGCTGTGATGTCAGTAGCATTTGCCGGGCGGTCTCTTTTTTTCACATCCCTATCAAATGGTTCAGTTAACCCGGTTGATCTTGGAGGTGGTGATGATACATTCATTGATCTTCTCATGATTTCATTATCAAATGCACTCCTTGGTATGGATGCCAATGATCCTCCTAAAACCTTGGCAACAATGCAACTAATAGGTTCCATCTTTTCCAATATGGCTATGCTGGATGATAATATAGATGAGCTCTCGTTCATGCCCATGATTCGCTTTTCTGAATGGCTAGATGAATTCTTTTGCCGCCTATTTTCATTACTTCTACATTTGGAACCCAGCAGTGTTCTGAATGAAGGCCTTCATTCATCAGCAACATCAGGAACTTTTCTGGTTGAAGATGGACCATACTACTTTTGCATGCTTGAAATCTTGCTTGGGAGACTTTCAAAACAACTATATAATCAGGCTTTGAAGAAAATCTCCAAATTCGTTTGGACAAATATTCTTCCTGGGGCAATTGCAGAGGTAGGACTGCTTTGTTGCGCATGTGTTCATTCAAATCCAGAAGAGGCGGTTGTTCACCTTGTAGAACCAATTTTATCATCTGTTCTATCCTCTTTGAATGGAACACCTGTTACAGGATTTGGAGGAAGAGGAATTCTGGATCCCTCAGTTTCAACCAAGGCTAAACCCACCCTTTCTCCAGCTCTTGAAACTGCAATTGATTATCAATTAAAAATATTATCAGTTGCCATCAGCTATGGAGGGTCTGCACTTCTCCATTACAAGGATCAATTTAAGGAAGCGATTGTTTCTGCATTTGACTCCCCTTCTTGGAAGGTTAATGGAGCTGGTGATCATCTTCTTCGGTCACTGCTTGGAAGCCTGGTCCTATATTATCCTATGGATCAATACAAGTGCATCTTGAATCACCCTGCTGCTGCTGCATTAGAGGAATGGATCAGCACAAAAGATTATTCTAATGATGGAGCACTGAAGGCCCCTAAATGGCATATTCCAAGTGATGAAGAAGTTCAATTTGCTAATGAACTTTTAATTCTCCATTTTCAATCGGCTTTAGATGATCTTTTAAGAATATGCCAAACTAAGATCCACTCTGATCCAGGCAACGAGAAAGAGCACTTGAAAGTGACTCTTTTACGTATTGATTCTTCATTGCAAGGTGTATTATCTTGCTTGCCTGATTTCAGGCCATCTTCCAGGAACGGCACGATTGAAGACTCTAGTTATCCTTCTTTTCTAATAGCTGGAGCTACAGGTTCAAGAGTTGGCAGCAATCAACTGCGGGAAAAGGCTGCTGAGGTTATACACACTGCCTGCAAATACTTACTAGAGGAAAAATCAGATGACAGCATTTTATTGATTCTCATTATACGTATCATGGATGCTCTTGGAAACTACGGAAGTTTGGAATATGACGAGTGGTCAAATCATAGGCAGGCTTGGAAGTTGGAATCTGCTGCCATTGTAGAGCCTCCAATAAATTTTATAGCATCTTCACATTCTAAAGGAAAGAGAAGGCCTAGGTGGGCTCTCATTGACAAGGCATACATGCACAGCACATGGAGATCTTCTCAATCATCTTATCATCTGTTTCGTACCAATGGAAATTTCTTGCCACCAGACCATGTAATTTTGTTGATGGATGATCTTTTAAATCTTTCTTTGCATAACTATGAAAGCGTTCGCATGCTTGCTGGAAAATCTCTGTTGAAGATAATGAAGAGGTGGCCATCTTTGATTTCAAAGTGTGTGCTCTCTCTGTGTGAGAATTTGAGGAAACCTAATTCACCGGACCATGCGGTTCTAGGTTCTTGTGCTGTGCTTTCTACACAGACAGTTCTGAAGCATTTGACAACGGATCCACAAGCATTTGGTTCATTTCTCCTTGCAATTCTTTTAAGCTCCCATCATGAATCACTGAAAGCCCAGAAAGCAATCAATGAGCTTTTTGTCAAATACAACATCTACTTTGCGGGTGTGTCTAAAAACATCTTTAAGACAGTGGATAATCACATAGATACCCCAGACTTTGCAGATCTGGTGTCTCAGATTGGTTCAATGAGTTTTGATTCTACGGGTTTGCATTGGCGGTATAATCTGATGGCTAACAGAGTTTTGCTCTTGTTGGCCGTGTCATGTAGGCATGACCCAAACTTTTCACCAAAAATCCTTGGTGAAACTGCTGGACACTTCCTAAAGAACTTGAAAAGTCAACTTCCTCAGACAAGAATACTTGCAATCTCGGCTCTAAATACGCTATTAAAAGATTCACCTTATAAGATGTCAGCTGATGATCGACCACTATTCTCTGGGAATTCACAAGAAAATGCCGAATCATCCCTTGAAGGAGCATTAAGGGAGATATTTCAGGAAGAGGGATTTTTTAATGAGACCTTAAATAGTTTGTCCCATGTCCATATAATAACTGATACTGAGAGTGCATCTTCTAGAGGAAATCATGGAAATTCTTCCTTTCAGAGCTTGGCTGACAAATCGATCACCCGTTTTTATTTCGACTTTTCAGCTACATGGCCACGTACTCCTAGTTGGATCTCTTTATTAGGAAGCGATACTTTTTACTCAAACTTTGCTCGTATATTTAAGCGGTTAATCCAAGAATGTGGAATGCCGGTTTTACTTGCACTGAAAAGTACATTGGAGGAGTTTGTCAATGCCAAGGAGAGGTCTAAGCAGTGTGTCGCTGCTGAAGCATTTGCTGGAGTGTTACATTCTGATGTCAATGGCCTTTTAGAGGAATGGGACAGCTGGATGATGGTCCAGTTGCAGAACATTATTCTTGCTCAATCGGTGGAATCCATTCCTGAGTGGGCAGCTTGTATACGTTATGCAGTTACAGGAAAAGGAAAGCATGGAACAAGAGTTCCCCTTCTGAGGCAACAGATTTTGAACTGCTTGTTGACACCTTTACCTCCAACTGTAACTACAACTGTAGTTGCGAAGCGGTATGCTTTTATTTCTGCTGCACTTATAGAGCTATCCCCGCAAAAAATGCCTGTGCCTGAGATACAGATGCACAATAAACTTCTGGATGAATTGCTGGGTAATATGTGCCATTCATCGGCCCAAGTAAGGGAAGCTATTGGGGTTACCCTTTCTGTGTTGTGCTCTAACATTCGGCTCCATGCGTCATCTTCGCAAGATCATTCGAATGACAGGGGAAAGACTAATATCAATAACCAACTTAAGGAGGAAAATTGGGTTCAACTACTAACGGAAAGAGCATCCGAACTTGTTGTGAACATTCAGAATTCTAGCCTGTCTGATGTTATAGATACCTCGACAGATATAAGTACCAAAAATGGTTATCAGAATGGTGATTCACAGGATGATGTCAAATGGATGGAAACTTTATTTCATTTTATCATATCAACTTTGAAGTCTGGAAGATCTTCATATTTGCTTGACGTGATTGTGGGGCTTCTATATCCTGTAATTTCCTTGCAGGAAACGTCAAACAAAGATTTGTCAACGTTAGCAAAGGCAGCATTTGAATTACTAAAATGGAGAATCATTTTGGAACCCCATCTCCAGAAGGCTGTTTCTGTTATTCTTTCTTCTGCAAAGGATCCTAACTGGCGAACTAGATCAGCAACTCTAACATATCTACGAACTTTTATGTTCAGGCACACCTTCATTCTCTTGAAAGGGGACAAACAAAAGATCTGGAAAACAGTGGAGAAGCTACTTCAAGACAACCAAGTGGAGGTAAGAGAGCATGCTGCAGGGGTGCTAGCTGGCCTAATGAAGGGTGGGGATGAAGATTTAGCTGGAGATTTCCGTGATAGGGCATACATAGAGGCAAATTCCATTCAAAGAAGGAGAAAGACAAGGAATGCAAATTCTGGACACTCTGTGGCATCTGTACATGGTGCAGTACTTGCTCTGGCAGCTTCGGTGTTATCAGTCCCATATGATATGCCCAGATGGTTACCTGATCACGTTACATTACTGGCTCGCTTCAGTGGGGAGCCATCACCTGTAAAATTGACTGTGACAAAAGCAGTTGCTGAGTTCCGGCGTACGCATGCAGATACATGGAACGTTCAAAAGGATTCGTTTAATGAAGAGCAACTTGAGGTCCTGGCAGATACATCGTCCTCATCGTCATATTTTGCTTGA |
Protein: MLRWGNILVRLLNKYRKKLSLKVQWRPLYDTLIHTHFTRNTGPEGWRLRQRHFETVTSLVRSCRRFFPAGSASEIWFEFRSLLENPWHNATFEGAGFVRLFLPTNSDNQDFFSDNWIRECMELWDSIPNCQFWNGQWTAVMARVVKNYKFINWECFLPTLFTRFLNMFEVPVASGSGSYPFSVDVPRNTRFLFSNKTVTPAKAIAKSVVYLLKPGSMAQEHFEKLVNLLEQYYHPSNGGRWTYSLERFLLYLVITFQKRLQHEQQNTDNDSQAELYLGKLERSAFVNVLLRLIDRGQYSKNEHLSETVAAATSILSYVEPSLVLPFLASRFHMALETMTATHQLKTAVMSVAFAGRSLFFTSLSNGSVNPVDLGGGDDTFIDLLMISLSNALLGMDANDPPKTLATMQLIGSIFSNMAMLDDNIDELSFMPMIRFSEWLDEFFCRLFSLLLHLEPSSVLNEGLHSSATSGTFLVEDGPYYFCMLEILLGRLSKQLYNQALKKISKFVWTNILPGAIAEVGLLCCACVHSNPEEAVVHLVEPILSSVLSSLNGTPVTGFGGRGILDPSVSTKAKPTLSPALETAIDYQLKILSVAISYGGSALLHYKDQFKEAIVSAFDSPSWKVNGAGDHLLRSLLGSLVLYYPMDQYKCILNHPAAAALEEWISTKDYSNDGALKAPKWHIPSDEEVQFANELLILHFQSALDDLLRICQTKIHSDPGNEKEHLKVTLLRIDSSLQGVLSCLPDFRPSSRNGTIEDSSYPSFLIAGATGSRVGSNQLREKAAEVIHTACKYLLEEKSDDSILLILIIRIMDALGNYGSLEYDEWSNHRQAWKLESAAIVEPPINFIASSHSKGKRRPRWALIDKAYMHSTWRSSQSSYHLFRTNGNFLPPDHVILLMDDLLNLSLHNYESVRMLAGKSLLKIMKRWPSLISKCVLSLCENLRKPNSPDHAVLGSCAVLSTQTVLKHLTTDPQAFGSFLLAILLSSHHESLKAQKAINELFVKYNIYFAGVSKNIFKTVDNHIDTPDFADLVSQIGSMSFDSTGLHWRYNLMANRVLLLLAVSCRHDPNFSPKILGETAGHFLKNLKSQLPQTRILAISALNTLLKDSPYKMSADDRPLFSGNSQENAESSLEGALREIFQEEGFFNETLNSLSHVHIITDTESASSRGNHGNSSFQSLADKSITRFYFDFSATWPRTPSWISLLGSDTFYSNFARIFKRLIQECGMPVLLALKSTLEEFVNAKERSKQCVAAEAFAGVLHSDVNGLLEEWDSWMMVQLQNIILAQSVESIPEWAACIRYAVTGKGKHGTRVPLLRQQILNCLLTPLPPTVTTTVVAKRYAFISAALIELSPQKMPVPEIQMHNKLLDELLGNMCHSSAQVREAIGVTLSVLCSNIRLHASSSQDHSNDRGKTNINNQLKEENWVQLLTERASELVVNIQNSSLSDVIDTSTDISTKNGYQNGDSQDDVKWMETLFHFIISTLKSGRSSYLLDVIVGLLYPVISLQETSNKDLSTLAKAAFELLKWRIILEPHLQKAVSVILSSAKDPNWRTRSATLTYLRTFMFRHTFILLKGDKQKIWKTVEKLLQDNQVEVREHAAGVLAGLMKGGDEDLAGDFRDRAYIEANSIQRRRKTRNANSGHSVASVHGAVLALAASVLSVPYDMPRWLPDHVTLLARFSGEPSPVKLTVTKAVAEFRRTHADTWNVQKDSFNEEQLEVLADTSSSSSYFA |